Genre tagging of videos based on information retrieval and semantic similarity using WordNet

نویسندگان

  • José Manuel Perea Ortega
  • Arturo Montejo Ráez
  • Manuel Carlos Díaz-Galiano
  • Maria Teresa Martín-Valdivia
چکیده

In this paper we propose a new approach for the genre tagging task of videos, using only their ASR transcripts and associated metadata. This new approach is based on calculating the semantic similarity between the nouns detected in the video transcripts and a bag of nouns generated from WordNet, for each category proposed to classify the videos. Specifically, we have used the Lin measure based on WordNet, which calculates the semantic distance between two synsets. Obviously, this approach has been only applied on the English test videos due to the use of WordNet, an English lexical resource. As base case, we have applied an information retrieval system as a classifier, using the generated bag of nouns for each category as index data and the ASR transcripts from each test video as query. Several experiments have been submitted, one of them combining both approaches (information retrieval and semantic similarity). As main conclusion we have shown that, using this combination of semantic similarity and information retrieval, we can improve the results obtained using the information retrieval approach only.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic Construction of Persian ICT WordNet using Princeton WordNet

WordNet is a large lexical database of English language, in which, nouns, verbs, adjectives, and adverbs are grouped into sets of cognitive synonyms (synsets). Each synset expresses a distinct concept. Synsets are interlinked by both semantic and lexical relations. WordNet is essentially used for word sense disambiguation, information retrieval, and text translation. In this paper, we propose s...

متن کامل

UAB at MediaEval 2011: Genre Tagging Task

We describe our approach and results towards the genre tagging task of MediaEval 2011. We approached this as an Information Retrieval task and applied a pseudo relevance feedback (PRF) approach for query expansion. Query expansion was also done using WordNet and Wikipedia

متن کامل

TUD-MIR at MediaEval 2011 Genre Tagging Task: Query expansion from a limited number of labeled videos

In this paper we present results of our initial research on genre tagging. We approach the task from information retrieval perspective using a relatively small number of labeled videos in the development set to mine query expansion terms characteristic of each genre. We also investigate which sources of information associated with the videos or extracted from their audio channel, e.g. title, de...

متن کامل

Information Retrieval by Semantic Similarity

Semantic Similarity relates to computing the similarity between conceptually similar but not necessarily lexically similar terms. Typically, semantic similarity is computed by mapping terms to an ontology and by examining their relationships in that ontology. We investigate approaches to computing the semantic similarity between natural language terms (using WordNet as the underlying reference ...

متن کامل

Semantic Retrieval Approach for Web Documents

Because of explosive growth of resources in the internet, the information retrieval technology has become particularly important. However the current retrieval methods are essentially based on the full text matching of keywords approach lacking of semantic information and can’t understand the user's query intent very well. These methods return a large number of irrelevant information, and are u...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011